Corpus: hif_wikipedia_2018_10K

Other corpora

5.2.18 Words nearly always together in sentences

Strong sentence co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/together_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency together Qoutient
India state 1161 1004 907 1.42
India division 1161 911 877 1.38
ka district 1092 1064 886 1.48
ka gaon 1092 1030 876 1.47
ka state 1092 1004 875 1.43
ka division 1092 911 873 1.31
district ka 1064 1092 886 1.48
district gaon 1064 1030 1012 1.07
district state 1064 1004 879 1.38
district division 1064 911 879 1.25
gaon ka 1030 1092 876 1.47
gaon district 1030 1064 1012 1.07
gaon state 1030 1004 876 1.35
gaon division 1030 911 876 1.22
state India 1004 1161 907 1.42
state ka 1004 1092 875 1.43
state district 1004 1064 879 1.38
state gaon 1004 1030 876 1.35
state division 1004 911 877 1.19
the acting 1001 772 770 1.30
55 msec needed at 2019-03-22 12:05